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Abstract 

The expansion of tools against HIV transmission has 
brought increased interest in epidemiological models 
that can predict the impact of these interventions. 
The EMOD-HIV model was recently compared to 
eleven other independently developed mathematical 
models of HIV transmission to determine the extent 
to which they agree about the potential impact of ex- 
panded use of antiretroviral therapy in South Africa. 
Here we describe in detail the modeling methodology 
used to produce the results in this comparison, which 
we term EMOD-HIV wO.7. We include a discussion 
of the structure and a full list of model parameters. 
We also discuss the architecture of the model, and its 
potential utility in comparing structural assumptions 
within a single modeling framework. 

1 Introduction 

This document provides a detailed description of the 
"EMOD-HIV" population model of HIV that was 
compared to several other HIV models in a recent ar- 
ticle by Eaton et al. published in PLoS Medicine [1]. 
The EMOD-HIV model was designed by researchers 
in the Epidemiological Modeling group at Intellectual 
Ventures Laboratory as part of a larger effort to en- 
able broad accessibility of modeling and quantitative 
analysis tools for planning and execution of infectious 
disease control. The group has thus far developed 
mathematical models of malaria 2], polio, and tuber- 
culosis in addition to HIV. The models and their ac- 
companying software infrastructure are designed pri- 
marily for use by disease modelers, researchers, epi- 
demiologists, and public health professionals seeking 
to simulate infectious disease conditions and evaluate 
the effectiveness of different eradication or mitigation 
approaches. 

This document describes only the HIV transmis- 
sion model, and not the other diseases, parallel com- 
puting infrastructure, or additional features of the 



software not utilized in the Eaton 2012 model com- 
parison. 

The Eaton 2012 model resuhs from EMOD were 
generated from a preliminary version of the model, 
which we will denote vQ.7. While vO.7 includes all the 
basic elements of a HIV simulation, to be discussed 
shortly, it lacks some elements that could have added 
heterogeneity to behavior and biology of HIV trans- 
mission, such as variable viral loads and disease stage 
durations. Additionally, the model parameters were 
calibrated using a rudimentary search algorithm that 
did not allow for parameter uncertainty to be incor- 
porated in error bound calculations. More detailed 
and systematically calibrated versions of the model 
are in progress. Outputs from vO.7 should be consid- 
ered preliminary. 

2 Overview of the Model 

EMOD-HIV vO.7 is an individual-based stochastic 
simulation of sexual and vertical HIV transmission 
implemented in C-t--l- and parameterized for the set- 
ting of South Africa. More specifically, vO.7 targets 
South Africa at a national level, with sexual mix- 
ing parameters based on data from KwaZulu-Natal 
province. 

Geographic distribution of the population was not 
explicitly modeled in vO.7, but an explicit network of 
human relationships was formed based on pair forma- 
tion rules, and allowed to evolve over time. Though 
capable of modeling homosexual partnerships and 
commercial partnerships (in which providers are dis- 
tinguished from patrons), vO.7 utilized only het- 
erosexual relationships due to their importance in 
the modern sub-Saharan Africa transmission setting 
3]. Thus, HIV transmission could only result from 
heterosexual intercourse or mother-to-child vertical 
transmission in vO.7. Three types of heterosexual re- 
lationships were modeled, each characterized by dif- 
ferent distributions for ages of entrants, durations. 
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Figure 1: The EMOD-HIV model is designed to cleanly separate components related to the contact network 
(Network) from within- host biology of HIV infection (Intrahost). Both of these build upon a generic "Per- 
son" whose properties are more broadly applicable in epidemiological models (e.g., age, gender, and vital 
statistics). This Person object is specialized into "STI Person," containing attributes related to behavior 
within the contact network and potentially applicable to other STIs, and a further specialized "HIV Person" 
object containing attributes that are specific to the biology of HIV. Interventions can modify factors in both 
modules that influence transmission, such as the rates of condom use or use of antiretroviral therapy. In 
tiO.7, each HIV-discordant interaction is modeled explicitly, and the transmission probability is calculated 
based on behavioral factors (type of sexual contact, condom use, etc.) and biological factors (stage of HIV 
disease, co-infections, ART, etc.) after interventions are applied. 



and levels of concurrency. The youngest, shortest- 
duration relationship type was most likely to permit 
the individuals to simultaneously seek additional re- 
lationships. Thus, although the propensity to form 
multiple simultaneous partnerships was not explicitly 
age-dependent, the types of partnerships preferred 
by younger individuals implicitly increase their like- 
lihood to enter concurrent partnerships. 

The remainder of this article describes the EMOD- 
HIV vO.7 model in detail, with subsections for demo- 
graphics, individual properties, the pair formation al- 
gorithm, relationship flow, sexual relationships, sex- 
ual acts, HIV transmission, HIV infection (with and 
without treatment), and the antiretroviral therapy 
(ART) intervention. The article concludes with a 
brief discussion of future work. 

3 Detailed Model Description 
3.1 Model Structure 

The EMOD-HIV model propagates forward in time 
using a combination of discrete events and time steps. 
Discrete events, implemented using C-|--|-ll lambda 



expressions, allow for one or more function calls to 
be scheduled for execution at a future time in the 
simulation. 

As with all the EMOD disease models, the HIV 
model utilizes object-oriented programming design 
principles such as interfaces, factories, and observers 
to achieve a highly modular architecture, thus en- 
abling comparisons of structural assumptions and dif- 
ferent levels of model detail. For example, parameters 
specific to the within-host biology of disease progres- 
sion and to behavioral propensities that drive pair 
formation have been separated into different inher- 
itance classes. Thus, modules or sub-modules can 
be interchanged while leaving other portions of the 
model intact. Figure [l] provides diagram of the archi- 
tecture, the components of which will be discussed in 
detail in the sections that follow. 

3.2 Demographics 

Vital dynamics within the EMOD-HIV model are 
derived from fertility and mortality tables that are 
passed to the model as input. In vO.7, these val- 
ues came from the Actuarial Society of South Africa 
(ASSA) AIDS and Demographic Model (4j, using 
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linear interpolation between age-specific data points 
from ASSA to construct a cumulative probability dis- 
tribution function (CDF) of death date from a per- 
son's date of birth. Upon instantiation, individual 
agents in the model sample stochastically from this 
CDF using an inverse transform of this distribution. 
Female agents similarly sample the age at next child- 
birth, if any, upon instantiation and birth of a pre- 
vious child. Pregnancy is not linked to relationship 
status in vO.7, although newly born individuals are 
linked to a mother. The fertility rate changes by 
simulation year and female age, and the available es- 
timates range from 1980 to 2020. Values outside of 
this range were chosen by "clamping" - i.e., choosing 
the nearest value within the range. Clamping was 
also used when necessary to determine the non-AIDS 
mortality rate, which varies by gender, age, and sim- 
ulation year. 

Mother-to-child transmission (MTCT) is enabled 
in vO.7 using the linkage between the mother and off- 
spring created by the demographics module. The 
basehne rate of MTCT is 1 in 3 births [3 li [7]. 
Antiretroviral therapy (ART) reduces the transmis- 
sion rate to 1 in 100 births [8], except in the final 
9 months prior to death due to ART failure, when 
it is assumed to have no effect on MTCT because 
HIV-related death after ART initiation is assumed to 
occur due to virological failure. (As we will show, 
many individuals do not reach the time of virological 
failure and death on ART because non-HIV mortal- 
ity occurs at an earlier time - i.e., they achieve a 
normal lifespan while on ART.) Additionally, 45% of 
mothers not on ART receive single-dose Nevirapine 
prophylaxis (sdNVP), which reduces the MTCT rate 
to 1 in 10 births f9l[T0l[Il]. 

3.3 Individual properties 

Individuals within the simulation have a variety of 
properties, represented by continuous or discrete 
state variables. Some properties are static through- 
out life, and others dynamically change through the 
course of the simulation either in response to ag- 
ing or simulation events such as infection. Included 
in these properties are complex dynamic objects 
for HIV infection and relationships, to be described 
shortly. Static properties are assigned upon instanti- 
ation (simulation initialization or birth after the be- 
ginning of the simulation) and include gender, time of 
birth, time of non-HIV death, STI co-infection, and 
male circumcision. In vO.7, the latter two are sampled 
randomly at birth with respective probabilities of 7% 
and 42% f3|. Dynamic properties include HIV status, 
history of ART program participation and drop-out 



if applicable, and relationship participation. 

3.4 The Pair Forming Algorithm 

Individuals enter a sexually active state at an age 
randomly sampled from a WeibuU distribution with 
a shape parameter of 20 and a mean age of 16.5 years 
and 15.5 years for males and females, respectively, 
and never permitting sexual activity earlier than 13 
years of age [31 [TH [T31 [TJ . These values were deter- 
mined by calibration, using national survey results as 
a prior [SI [H [H [13 [TB] . Individuals that have ex- 
ceeded their stochastic sexual debut age are able to 
enter into three types of sexual relationships: tran- 
sitory, informal, and marital. Three Boolean static 
properties of individuals, termed "extra-relational 
flags," determine whether an individual can enter ad- 
ditional concurrent relationships while participating 
in a relationship of each of the three types. In vO.7, 
these flags are stochastically assigned at instantiation 
with probabilities of 80%, 30%, and 10% for transi- 
tory, informal, and marital relationships, respectively. 
Note that these flags indicate the ability of an in- 
dividual to participate in concurrent relationships, 
and were determined by calibration, but the resulting 
point prevalence of concurrency ends up close to val- 
ues found in (12^. These extra-relational flags create 
heterogeneities in HIV risk: an individual who re- 
ceives all three flags, for example, would be likely to 
participate in many more lifetime partnerships than 
one who receives no flags. 

The algorithm used to take individuals seeking re- 
lationships and form pairs is called the pair forma- 
tion algorithm (PFA). Each of the three relationship 
types uses the same PFA algorithm, but operates on 
different data about the age distribution and age gaps 
within partnerships. 

PFA takes as input a matrix, called the matrix of 
partnership age distributions (MOPAD), in which the 
entry at position {i,j) is the probability of a male of 
age bin i pairing with a female of age bin j. The three 
MOPADs used in the can be seen in Figure [2ja-c) and 
are based on published data by Ott et al. from the 
Africa Centre for Health and Population Studies [T7] . 
Ott et al. report their data in 5-year age bins; these 
were sub-divided to 2.5-year age bins, shifted to rep- 
resent incident rather than prevalent relationships, 
and smoothed using a Gaussian kernel. Membership 
in an age bin was determined based on each individ- 
ual's time of birth. 

Individuals enter the PFA seeking a relationship, 
and are immediately placed into a gender-specific 
queue, see Figure [3j There is a single queue for 
all males, and for females, there are separate queues 
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Figure 2: Transitory (left column), informal (middle column), and marital (right column) relationships are 
characterized by their matrix of partnership age distributions (MOPAD, top row) and by their WeibuU- 
distributed duration (bottom row; note the difference in scale on the time axis). On average, transitory 
relationships recruit the youngest individuals, create the smallest age gap between partners, and have the 
shortest durations, while marital partnerships recruit the oldest individuals, create the largest age gap 
between partners (approximately 5 years), and have the longest durations. 



for each 2.5-year age bin. The queues fill over a pe- 
riod of time (fourteen days for marriage and one day 
for transitory and informal) , after which relationships 
are formed by a processing step. The processing step 
works linearly through the male queue, starting from 
individual at the head of the queue. A female part- 
ner for the head male is determined based on age 
and availability. More specifically, a male of age bin 
TO samples a partner age bin, /, from 

I otherwise. 

Here, Nf is the number of females queued in age bin / 
and p{F — f\M = m) is the conditional distribution 
derived from the MOPAD. If p{F — /) is identically 
zero, the male remains in the queue for the next pro- 
cessing round. Otherwise, the female of the sampled 
age bin who has been waiting the longest is selected 
as the partner. The paired individuals are removed 
from their respective queues, and the process is re- 
peated. 



3.5 Relationship Flow 

The pair formation algorithm can produce relation- 
ships distributed according to the MOPAD matrix 
only if the males and females entering the algo- 
rithm have age distributions that match the respec- 
tive marginals of the MOPAD, and the total num- 
ber of females matches the total number of males. 
The (exponential) rates at which individuals in each 
age bin and gender enter for each relationship type 
are adjusted daily to form the relationships dictated 
by the MOPAD. The MOPAD dictates only the rel- 
ative number of relationships formed between pairs 
of different ages, but not the absolute number of re- 
lationships formed by the PFA. This total through- 
put of relationships formed is set such that the ex- 
pected number of individuals seeking a relationship 
of a particular type, after rate adjustment to meet 
the MOPAD, matches the number of males and fe- 
males that would have sought that relationship had 
the rates not been adjusted for the MOPAD. These 
unadjusted rates are listed in Table [T] Thus, the 
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Table 1: Base exponential relationship formation 
rates and WeibuU dissolution distribution parame- 
ters. 



Figure 3: A diagram of the Pair Formation Algorithm 
(PFA). Males and females enter from the left. Males 
form a single queue, while females are sorted into 
queues according to their age, represented above as 
labeled arrows. Males and females accumulate in the 
order they entered and remain in their queues until 
the PFA is processed. Processing takes place once per 
day in the transitory and informal PFAs, and once 
every 14 days in the marital PFA. During processing, 
the individual at the head of the male queue selects 
a partner based on his MOPAD-derived conditional 
probability weighting (represented by dashed green 
lines). The paired individuals exit the algorithm to 
the right. Only individuals with no available partners 
of their chosen type are left in queues after processing 
is complete. 



MOPAD adjustment changes the age distributions of 
the individuals seeking relationships, but not the to- 
tal number of each type of relationship formed. 

This adaptive daily rate control allows the model 
to automatically discover the rates of relationship en- 
try that are consistent with the MOPAD. It is con- 
ceivable, however, that events causing large demo- 
graphic shifts might change the MOPAD. For exam- 
ple, when comparing simulations with universal HIV 
treatment versus no treatment, it is conceivable that 
demographic influence of the disparate AIDS death 
tolls should cause the MOPADs to diverge. There- 
fore, adaptive rate control is disabled after an initial 
burn-in period of 15 years, after which the entrance 
rates remain fixed at their final controlled values and 
the MOPAD is permitted to change as the simulation 
progresses. 

The calculation of the total relationship formation 
rate (Table [l]) and adjustment of individual relation- 
ship entry rates are based only on individuals eligible 
to form a new relationship. Eligibility to form a new 
relationship depends on several factors. Individuals 
that have not surpassed their debut age are not eligi- 
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ble for any relationships, and all individuals must be 
at least 15 years old to marry. Post-debut individuals 
who are not already in a relationship are eligible to 
add any of the three relationships. Young individuals 
(e.g. < 20) will tend to enter a transitory relationship 
and older individuals (e.g. > 40) will tend to enter a 
marital relationship due to their different age distri- 
butions shown in Figure [T] 

Once in a relationship, the previously described 
Boolean concurrency flags determine if the individu- 
als is allowed to add additional (concurrent) relation- 
ships, after a waiting period of 60 days. For example, 
if the transitory concurrency flag is true, the individ- 
ual is allowed to add additional relationships of any 
type as long as their only ongoing relationships are 
of transitory type. Coital dilution, discussed later, is 
applied to any concurrent partnerships, and the max- 
imum number of simultaneous relationships restricted 
to three transitory, two informal, and one marital re- 
lationship (s). Note that, under these assumptions, a 
small subset of the population will have all three flags 
true, and comprises an especially high-risk subgroup. 
Note also that in this model, the formation of a mar- 
ital partnership, which has the lowest probability of 
permitting concurrency and the longest duration, will 
often "protect" individuals from taking on additional 
partnerships later in life. 

3.6 Sexual Relationships 

Sexual relationships are instantiated by the three 
PFAs and have characteristics including the partic- 
ipating individuals, condom usage rate, formation 
time, and scheduled breakup time. When the re- 
lationship forms, its breakup time is sampled from 
a PFA-type-dependent WeibuU distribution, see Fig- 
ure [2]Jd-f). The mean and shape parameters of these 
distributions are listed in Table[T] Note that relation- 
ships can terminate early if either partner dies, and 
the surviving partner is treated as eligible, equiva- 
lently to a someone who has become eligible through 
a relationship breakup. 
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Table 2: Sigmoidal condom ramp parameters. 

Low (/) High (h) Middle Year (to) Rate (r) 

Transitory 0.0125 0.6375 1999 0.3 

Informal 0.0125 0.5125 1999 0.25 

Marital 0.000625 0.200625 2002 0.225 



Each sexual relationship has a fixed condom usage 
probability, called the base rate, which is determined 
at the time the relationship is created. The func- 
tional form of the base condom usage probability is 
an increasing sigmoid, 

the parameters of which depend on the type of re- 
lationship (see Table [2|. The relationship type- 
dependence of these parameters is to account for the 
fact that condom usage is lower in longer-term re- 
lationships, while the time-dependence accounts for 
reported increases in condom usage over time. Self- 
reported condom usage probabilities vary greatly. 
The values in Table [2] come from calibrating param- 
eters that were initially determined from data given 
in the 2002, 2005, and 2008 HSRC national surveys 

[SUHIIi]. 

For half of the relationships with a base rate above 
10% and below 90%, a modified rate of either 10% 
or 90% was used to add additional heterogeneity by 
increasing the variance. This was implemented in a 
way that ensured that the expected condom usage 
probability remained equal to the base rate. 

3.7 Coital Acts 

Individual coital acts are simulated for HIV- 
discordant relationships only. When an individual be- 
comes infected, the discordancy states of all relation- 
ships involving the individual are updated. Within 
HIV-discordant relationships, the base time between 
coital acts is exponentially distributed with a mean 
of three days 18 . For individuals in two, three, or 
four-plus concurrent partnerships, our model addi- 
tionally accounts for coital dilution, i.e., a negative 
correlation between the number of simultaneous part- 
nerships and the frequency of coital acts within these 
partnerships [191 EO] . While some studies suggest a 
very strong effect of coital dilution [51] [55], others 
find little evidence for this phenomenon [53J [51] . We 
therefore chose a relatively modest parameterization 
of coital dilution, in which the act frequency per rela- 
tionship for an individual belonging to two, three, or 
four (or more) concurrent relationships is multiplied 



by 0.75, 0.60, and 0.45, respectively. Condom usage is 
assigned to each coital act based on the relationship's 
usage probability, and higher-risk anal intercourse is 
selected 2% of the time. 

3.8 Stages of HIV Infection 

We assume three stages of untreated HIV infection: 
acute, latent, and AIDS. In vO.7, the durations of the 
acute and AIDS stages are 2.9 months and 9 months, 
respectively, for all individuals [5S]. The total sur- 
vival time with HIV is sampled stochastically based 
on the individual's age at the time of infection (dis- 
cussed below), and the duration of the latent phase 
is the total survival time minus the duration of the 
acute and AIDS stages. If the total survival time is 
shorter than 11.9 months, then the AIDS phase is 
shortened, and in rare cases when survival is shorter 
than 2.9 months, the remaining acute phase is short- 
ened to fit within the survival duration. 

3.9 HIV Transmission 

The probability of HIV transmission within a dis- 
cordant relationship is calculated on a per-act basis 
and varies by disease stage. The per-act transmission 
probabilities for the acute, latent, and AIDS stage are 
3AE - 2, 1.3^; - 3, and 9AE - 3, respectively. The 
relative magnitude of these values in each stage of 
disease comes from the per-time transmission rates 
measured in HIV-discordant couples in Uganda ,25i . 
These probabilities have been multiplied by the in- 
verse act frequency and increased by 50% as a result 
of the model calibration process. Note that these 
transmission probabilities apply in the absence of any 
interventions or other cofactors. The per-act trans- 
mission probability is multiplied by applicable act- 
dependent factors, which are described in Table [3] 
There is no gender bias in transmission other than 
the male circumcision cofactor, which reduces the cir- 
cumcised male's HIV acquisition probability by 60% 
[261 [271 [28]. 

If the HIV positive individual is on ART and not 
experiencing treatment failure, then the transmission 
probability per act is set to 4% of the latent-stage 
transmission probability [29]. When failing ART, the 
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Table 3: Cofactors used in determining the HIV transmission probability. 

Cofactor Transmission Probability Multiplier 

Male circumcised (F— )-M transmission only) 0.4 

STI present 4.0 

Condom used 0.1 

Anal intercourse (infected receptive female partner) 4.0 

Anal intercourse (infected insertive male partner) 10 




Figure 4: The untreated HIV prognosis of an individual is determined at the time of infection using this 
age-dependent distribution. Before age 15, a sum of two weighted Weibull distributions is used to represent 
the typically bimodal survival time of children. Beyond age 15, survival time is Weibull-distributed, with 
older individuals having a shorter average survival time. Note that untreated HIV-infected individuals die at 
the end of their HIV prognosis or their normal lifespan (determined by the demographics module), whichever 
occurs earlier. 



transmission probability is set to the AIDS transmis- 
sion probability. 



3.10 Survival with Untreated HIV In- 
fection 

HIV prognosis is both heterogeneous and highly age- 
dependent [301 1311 132] , and is therefore sampled from 
an age-dependent distribution, shown in Figure [4] 
For individuals over the age of 15, the distribution 
is Weibull-distributed with shape parameter k set to 
2 and an age- variable scale parameter A varying with 
age a (in years) as follows [50] : 

A = 21.182 - 0.2717a (3) 
For those under 15, the distribution is a convex 



combination of two Weibull distributions, 

Ai = 1.515 + 1.039a (4) 

Ki 0.97 + 0.0687a (5) 

A2 = 10.0 + 0.474a (6) 

K2 = 5.39 - 0.226a, (7) 

distinguishing infant mortality and later child mor- 
tality [33j. The first distribution is used 65% of the 
time. 

CD4-t- T-cell count is time-dependent based on the 
fraction remaining of the individual's prognosis /, 
calculated by dividing the time since infection by the 
total HIV prognosis. It drops to 594 cells//iL imme- 
diately upon infection to represent very rapid CD4 T- 
cell depletion (and partial rebound) during the acute 
phase (the detailed timecourse of which is not mod- 
eled here), and decreases down to 59 cells//iL at death 
as follows: 

CD4(cells//iL) = (24.363 - 16.672/)^. (8) 
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3.11 HIV Infection (with ART) 

Mortality for individuals receiving ART is determined 
using a multivariate Cox proportional hazards model 
from the International Databases to Evaluate AIDS 
- South Africa [3H , which takes as input the in- 
dividual's age, gender CD4 count, and body weight 
to estimate the prognosis. Note that this database 
utilizes ecological studies of ART efficacy, and thus 
includes "real-world" factors such as poor adherence, 
co-morbidities, misuse of ART, undetected drug re- 
sistance, etc., which may contribute to mortality on 
ART and not be captured in randomized controlled 
trials. 

In addition to the individual's age and gender, the 
WHO stage and body weight are estimated by as- 
suming individuals progress from the pre-infection 
state through WHO stages from one to four stochas- 
tically, with the interval from WHO stage 1 to 2, 
2 to 3, and 3 to 4 samples from a WeibuU distri- 
bution with shape parameters 0.26596, 0.19729, and 
0.34721, respectively, and scale parameters 0.96604, 
0.99160, 0.93560, respectively [36]. Any remaining 
pre-treatment survival time is spent in WHO stage 4. 
Body weight is assumed to decline according to the 
midrange values of the WHO AIDS staging guide- 
lines: from a pre-infection weight of 65 kg to 62.1 kg 
in Stage 1, 57 kg in Stage 2, 50 kg in Stage 3, and 
40.1 kg in Stage 4. Calculation of CD4 decline was 



described earlier (see Section 3.10). 

The survival time, s, for an individual on ART was 
calculated as 



s(years) 



/ 123.83 
\ rn 



2.9 



log ( - 
r 



2.9 



(9) 



in which r is a uniformly distributed random number 
between and 1, and m is the appropriate multi- 
plier applied for the individual based on the condi- 
tions listed in Table |4) Mortality while on ART is 
assumed to be associated with a period of virologi- 
cal failure with the same duration and infectivity as 
the late-stage AIDS phase of untreated disease. For 
a majority of individuals, this period is longer than 
their HIV-independent survival, such that they never 
survive to realize the higher-infectivity phase at the 
end of successful viremic suppression by ART. 

In the event of dropout from the ART program, an 
individual's recovery while on ART is represented by 
an increase in their CD4 count, which occurs inde- 
pendently of their starting CD4 count and saturates 
after three years on ART [371 |M1 [Ml iOl SI] ■ The 
increase is calculated as [57] : 

CD4 increase(cells/^L) = 15.584t - 0.2113^^ (10) 



Condition 

WHO Stage 3 or 4 
Age > 40 years 
Female gender 

CD4 at enrollment 

25-49 
50-99 
100-199 
200-1- 



Multiplier 

2.7142 
1.4309 
0.6775 



0.7497 
0.4258 
0.3068 
0.2563 



Table 4: Multipliers used in survival time calculation. 



in which t represents the time, in months, that the 
individual has received ART. The increase saturates 
after 36 months and does not allow the individual 
to exceed their pre-infection CD4 count. CD4 recon- 
stitution is used to compute the untreated prognosis 
following dropout from the ART program, essentially 
turning back the clock on the infection to the fraction 
of remaining prognosis that corresponds to this CD4 
count. 

When an individual is re-enrolls in an ART pro- 
gram after a prior dropout, the process to re-compute 
the ART prognosis is identical to the initial enroll- 
ment; however, the individual's prognosis parameters 
may change as the individual will have aged in the in- 
tervening time. 

3.12 ART Intervention 

The ART intervention was prescribed by the Mod- 
elling Consortium organizers [1]. CD4 thresholds for 
treatment eligibility are 200 cells//iL, 350 cells//iL, 
and immediate. Once eligible, an individual samples 
an enrollment date from an exponential distribution 
with one-year mean. A fraction of the population is 
not allowed to enroll, as prescribed in the guidelines. 
ART is not distributed preferentially by age, gender, 
or any other criteria than CD4 threshold. Of indi- 
viduals dropping out of the ART program, 50% are 
eligible to re-enroll. See Eaton et al. 2012 for a de- 
tailed discussion of the interventions modeled in the 
study [1]. 

3.13 Timecourse of a Simulation 

The timecourse of each simulation run is depicted 
graphically in Figure [5] Simulation begins in year 
1960 to allow ample time for relationships to "burn- 
in." During the period between 1960 and 1975, the 
relationship formation rates for each gender and rela- 
tionship type are updated daily using the relationship 
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Figure 5: The process of an EMOD-HIV simulation is depicted graphically. After initialization steps such as 
memory allocation and parsing of input files, the simulation begins in 1960. Newly instantiated individuals 
with demographic properties dictated by the demographics module begin forming relationships at rates that 
are adjusted daily based on the MOPAD. After a burn-in period of 15 years, the entrance rate adjustment 
is stopped, and the age-dependent entry rates into each PFA are fixed at their 1975 rates for the rest of 
the simulation. In 1980, HIV infection appears in 5% of the population at random, with their time since 
infection sampled uniformly from their total untreated HIV prognosis. This high initial seeding of infection 
was used to offset the disproportionate number of low-risk infections created by this method of seeding the 
early epidemic. The intervention (ART distribution according to a variety of criteria and coverage levels) 
is applied instantaneously in 2012, and the simulation continues until 2050. Output files from numerous 
simulations are combined for statistical analysis in Matlab. 



fiow algorithm. Adjustment of PFA entry rates is ter- 
minated in 1975, and the rates are fixed at their 1975 
values for the remainder of the simulation. 

Infections are seeded in year 1980. The seeding 
process infects 5% of the population according to 
the age and gender distribution reported in the 2008 
HSRC survey [15] . This prevalence is higher than val- 
ues generally accepted for 1980 in South Africa; how- 
ever, reproducing the historical trend of HIV preva- 
lence would require a model that acknowledges the 
behavior change that has occurred from the 1980's 
until the present day as a result of increasing HIV 
awareness. These include reductions in commercial 
sex work and other high-risk activities, as well as 
increases in protective behavior such as condom us- 
age, male circumcision, and serosorting-like partner 
choice. Since little data is available about behavior 
during the early HIV epidemic, we chose not to model 
these early-epidemic conditions, and therefore initial- 
ized our simulation at artificially high prevalence to 
account for the relatively lower-risk individuals who 
are initially infected in our model. 

The ART intervention is enabled in year 2012. Eli- 
gible individuals enroll in ART according to an expo- 
nential distribution having a mean of one year. The 



ART intervention continues through the conclusion 
of the simulation in year 2050. 

3.14 Model Calibration 

The EMOD-HIV vQ.7 calibration process used data 
from the 2002, 2005, and 2008 HSRC surveys [llll 
[T5] in addition to antenatal prevalence data from 1990 
to 2009 The parameters that were subject to the 
most careful calibration were those to which the epi- 
demic trajectory was highly sensitive, but for which 
available data is insufficient to know their value with 
reasonable certainty. These include the absolute per- 
contact transmission rate, the rates of condom use by 
year in each relationship type, and the probability of 
concurrency in each relationship type. 

As previously mentioned, the calibration process 
used to generate the results presented in Eaton et 
al. 2012 was rudimentary. Parameter values were ini- 
tialized to values reported in field studies, even when 
these values have high uncertainty and likelihood of 
survey bias. Then, a coordinate descent with golden 
section search was used to incrementally adjust pa- 
rameter values, one at a time, so as to minimize a 
squared error loss function. This function penalized 
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differences between tlie simulation output and HSRC 
prevalence data by age and gender, and ANC preva- 
lence as compared to simulated females aged 25 to 30. 
Parameters that are known with the lowest certainty 
were prioritized for adjustment. 

This approach to calibration worked to a limited 
extent, but was unable to resolve co-varying param- 
eters. These parameters were adjusted by hand in 
order to minimize the same loss function. Subse- 
quent versions of the EMOD-HIV model will employ 
a sophisticated calibration process that uses Bayesian 
methods. 

4 Conclusions and future work 

The EMOD-HIV model is a work in progress. As- 
sumptions have been improved and numerous fea- 
tures have been added since the initial Stellenbosch 
Modelling Consortium meeting. One area of improve- 
ment is model calibration. As previously mentioned, 
the model was initially calibrated using a rudimen- 
tary algorithm that required much adjustment by 
hand. We have since developed a Bayesian calibra- 
tion tool based on Incremental Mixture Importance 
Sampling [43 . Improved calibration will allow pa- 
rameter uncertainty to be included in estimates in ad- 
dition to finding appropriate values for key parame- 
ters. Future work will build upon the model described 
here. Features such as increased detail in our repre- 
sentation of STIs and viral-load driven transmission 
are planned for the immediate future. Additionally, 
the model will soon gain the ability to perform spa- 
tial simulation by using the spatial support features 
of the EMOD software implemented for all diseases. 
Migration between geographic locations, represent- 
ing communities, will be built using realistic migra- 
tion patterns. Finally, we plan to analyze a variety 
of interventions such as circumcision, ART treatment 
options, and vaccine candidates. 
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A Parameter Table 



Parameter Table 



Parameter 


Value 


Units 


Comments 






General 




Initial population 


75000 


people 


Individuals uniformly weighted to 








represent full S.A. population. 


Number of burn-in days 


7300 


days 


Days from 1960 to infection seeding 








in 1980. 


Duration of simulation 


32850 


days 


Days from 1960 to end in 2050. 


Initial prevalence fraction 


5 


percent 


Seeding prevalence, high for 1980 but 








gives good 2010 results. 


Circumcision probability 


42 


percent 


Nationally, 42% of men aged 16-55 








years report being circumcised. 


STI probability 


7 


percent 


Approximates fraction of STI (HSV- 








2) infected individuals that would be 








^npn nine 

O i-L O 1 A i. . 


Minimum age for sexual iu- 


13 


years 


Less than 10% have sex before 15 


f era ft 1 on 






11 f^l Sp'X' HpfoTP 1 .'^ p'XT'lnrlpr] 


Coital frequency 


0.33 


acts /day 


Self-reported coital frequency among 








discordant couples in R,akai, Uganda 






Network 




Male mean debut age 


16.5 


years 


Mean debut age for males reported 








as 16.3 [IB]. 


Female mean debut age 


15.5 


years 


Calibrated from mean debut of 17.2 








reported 


Days between adding rela- 


60 


days 


Avoids excessive concurrent relation- 


tionships 






ships, calibrated. 


Coital dilution with 2 part- 


75 


percent 




ners 






Relatively modest parameterization 


Coital dilution with 3 part- 


60 


percent 


based on conflicting data 


ners 






[m [201 [m nana [24]. 


Coital dilution with 4+ 


45 


percent 




partners 








Disable entrance rate ad- 


5475 


days 


Rate adjustment ends five years be- 


justment on sim day 






fore seeding. 


Transitory relationship for- 


0.00137 


formations / day 




mation rate 






Exponentially distributed formation 


Transitory relationship du- 


0.003044 


breakups / day 


and WeibuU distributed breakups 


ration (mean) 






result in reasonable durations. 


Transitory relationship du- 


1.2 


dimensionless 


relationship prevalence, and 


ration (shape) 






cumulative lifetime number of 


Informal relationship for- 


0.001442 


formations / day 


relationships. Adjusted during 


mation rate 






calibration. 


Informal relationship dura- 


0.001522 


breakups/day 





tion (mean) 
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Parameter Table Continued 








Informal relationship dura- 


2.0 


dimensionless 




tion (shape) 








Marital relationship forma- 


9.13E-5 


formations / day 




tion rate 








Marital relationship dura- 


1.37E-4 


breakups/day 




tion (mean) 








Marital relationship dura- 


1.5 


dimensionless 




tion (shape) 








Transitory PFA processing 


1 


day 




interval 






Processing intervals ensure sufficient 


Informal PFA processing 


1 


day 


partner diversity while avoiding 


interval 






excessive delays. 


Marital PFA processing in- 


14 


days 




terval 








Maximum simultaneous 


3 


relationships 




transitories 






Selected during calibration to arrive 


Maximum simultaneous in- 


2 


relationships 


at a 50 lifetime partners max while 


formals 






enabling concurrency. Polygamy is 


Maximum simultaneous 


1 


relationships 


not enabled. 


marriages 








Minimum age for marriage 


15 


years 


Marriage before 15 is unusual in 








South Africa [TB], as reflected in the 








marital MOPAD matrix. 


Extrarelational probability 


oU 


percent 




transitory 






Calibrated to get HSRC and ANC 


Extrarelational probability 


30 


percent 


prevalence to align while avoiding 


miormal 






too many lifetime partners. 


Extrarelational probability 


10 


percent 




marriage 










Sexual Transmission 




Acute prob transmission 


0.034027 


prob / act 


Relative values adapted from analy- 


per act 






sis of transmission among discordant 



couples in Rakai, Uganda 25j; abso- 
lute rates adjusted by 1.5 factor dur- 
ing calibration, and account for act 
frequency of 0.33 acts/day. Values 
consistent with meta-analysis [ilj . 



Latent prob transmission 


0.001307 


prob/ act 




per act 








AIDS prob transmission per 


0.00937 


prob/ act 




act 








ART viral suppression risk 


0.04 


dimensionless 


Based on HPTN052 trial [53] 


multiplier 








Circumcision risk multiplier 


0.4 


dimensionless 


Trials in South Africa, Kenya, and 








Uganda 1261 EH Eg 


Anal sex probability 


2.0 


percent 


2.3% in men and 1.2% in women re- 








ported in US [15] . 
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iT^lldl 1 loK lliUlLipiiCi 


1 n 


LlllllCllolUllltbo 


1.^/0 pel unpioLecLeti lecepiive act 








from a meta-analysis [HI |371 |31j 


STI risk multiplier 


4.0 


dimensionless 


2.58 for those with GUD in Rakai 








analysis [48]; 2-5 fold multiplier for 








HSV infection gH]. 


Condom risk multiplier 


0.1 


dimensionless 


80% reduction in incidence [50] and 








78% reduction found per act [H] 




Vertical Transmission 


Usage of sdNVP 


45 


percent /birth 


Adherence of 57.1% in Zimbabwe 








[52j: 75% adherence in S Africa (55] 


Mother to child transmis- 


10 


percent /birth 


SWEN trial risk in single dose group 


sion on sdNVP 






8.98% at 6 months with single dose 








NVP; HIVNET 012 11.8% and 15.7% 








at 6 week and 18 mo [51 [TOl [TT]. 


Mother to child transmis- 


30 


percent /birth 


30% (range 14-48%) listed in [51111 [3 


sion no intervention 






54. 


Mother to child transmis- 


3 


percent /birth 


2.9% at 203 days (excludes infants 


Sion on Arti 






born HIV+) [55j 2.5% at birth and 








7.0% at 24 mo in Kisumu breastfeed- 








ing study [56|. See also [5]. 






ART Intervention 




Enable ART 


T or F 




Varies across intervention scenarios, 








taise (1 j tor countertactual. 


Year of ART initiation 


2012 




Given by intervention scenario. 


Exponential ART dropout 


0.000263 


Varies across intervention scenarios. 


rate 








Allow ART re-enrollment 


T 




Re-enrollment is enabled. 


rroDaoiniy ox ajxi re- 


50 


percent 


Independent probabilities for each re- 


enrollment 






eniomneiiL eveiiL. 






Condom Sigmoid 




Condom low transitory 


1.3 


percent/act 




Condom high transitory 


64 


percent/act 




Condom mid year transi- 


1999 


year 




tory 








Condom rate transitory 


0.3 


1 /year 




Condom low informal 


1.3 


percent/act 




Condom high informal 


51 


percent/act 


Values calibrated based loosely on 


Condom mid year informal 


1999 


year 


condom usage data reported in 


Condom rate informal 


0.25 


1 /year 


national surveys. 


Condom low marriage 


6.25E-2 


percent / act 




Condom high marriage 


20 


percent/act 




Condom mid year marriage 


2002 


year 




Condom rate marriage 


0.225 


1 /year 
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